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Abstract 

N 

This  is  an  expository  survey  of  the  (univariate)  theory  of  extreme 
values  and  the  estimation  of  the  parameters  of  the  related  extreme- 
value  distributions.  This  survey  was  written  at  the  invitation  of 
Professors  N.L.  Johnson  and  S.  Kotz,  editors  of  the  forthcoming 
Encyclopedia  of  Statistical  Sciences  to  be  published  by  John  Wiley 
and  Sons,  Inc.,  New  York.  The  theory  of  extreme  values  plays  a 
fundamental  role  in  several  areas  of  applied  statistics,  such  as 
the  analysis  of  flood  flows,  the  reliability  of  complex  systems, 
the  analysis  of  air  pollution  data,  etc.  In  addition  to  surveying 
(without  proofs)  the  basic  results  of  extreme  value  theory  and  the 
estimators  of  the  parameters  of  the  extreme  value  distributions, 
this  survey  presents  a  brief  discussion  of  the  current  research  in 
these  areas. 
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The  theory  of  extreme  values,  and  the  extreme  value  distributions, 
play  an  important  role  in  theoretical  and  applied  statistics.  For 
example,  extreme  value  distributions  arise  quite  naturally  in  the 
study  of  size  effect  on  material  strengths,  the  occurrence  of  floods 
and  droughts,  the  reliability  of  systems  made  up  of  a  large  number  of 
components,  and  in  assessing  the  levels  of  air  pollution.  Other 
applications  of  extreme  value  distributions  arise  in  the  study  of  what 
are  known  as  "record  values"  and  "breaking  records."  For  an  up-to-date 
and  a  fairly  complete  reference  on  the  theory  of  extreme  values,  we 
refer  the  reader  to  the  recent  book  by  Galambos  (1978).  For  a  more 
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classical  yet  honorable  treatise  on  the  subject,  we  refer  to  Gumbel 
(1958). 

1 .  Preliminaries 

Suppose  that  ^ ,X2,**»,Xn  are  ^ePen<^ent  ar*d  identically 
distributed  random  variables  from  a  distribution  F(x)  which  is 
assumed  to  be  continuous.  The  theory  of  extreme  values  primarily 
concerns  itself  with  the  distribution  of  the  smallest  and  largest 
values  of  XrX2,**«,Xn.  That  Is,  If 

Xl:n  3  min(x1»x2»,-»xn)  3  X(l)  (K1) 


and 


X 


n:n 


max(X^ ,X2,*»*,Xn)  *  X(n)  * 


(1.2) 


then  knowing  F(x),  we  would  like  to  say  something  about 


Ln(x) 


Pr[) 


X(l)  and  X(n) 


(1 )  <  X 

are  a 


and  Hn(x) 


Pr[x(n)<x]. 


The  random  variables 


so  known  as  the  extreme  values. 

In  order  to  give  some  motivation  as  to  why  the  random  variables, 
Xp)  and  X^nj,  and  their  distribution  functions  are  of  interest  to  us, 
we  shall  consider  the  following  situations: 


1.  Consider  a  chain  which  is  made  up  of  n  links;  the  chain 
breaks  when  any  one  of  its  links  break.  The  first  link 
to  break  Is  the  weakest  link;  that  is,  the  one  which  has 
the  smallest  strength.  It  Is  meaningful  to  assume  that 
the  strength  of  the  ith  link,  say  X^,  1«l,2,»»*,n  is  a 
random  variable  with  distribution  function  F(x).  Since 
the  chain  breaks  when  Its  weakest  link  fails,  the  strength 
of  the  chain  is  therefore  described  by  the  random  variable 
X(1)  =  min(X1,X2,---,Xn). 
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2.  Consider  an  engineering  or  a  biological  system  which  is 

made  up  of  n  identical  components,  all  of  which  may  function 
simultaneously.  For  example,  a  large  airplane  may  contain 
four  identical  engines  which  could  be  functioning  simultane¬ 
ously,  or  the  human  respiratory  system  which  consists  of 
two  identical  lungs.  The  system  functions  as  long  as  any 
one  of  the  n  components  is  functioning.  Such  systems  are 
known  as  parallel-xvdundant  systems  and  occur  quite  often 
in  practice.  Suppose  that  the  time  to  failure  (the  lifelength) 
of  the  1th  component,  say  X^,  i*l,2,***,n  is  a  random 
variable  with  distribution  function  F(x).  Since  the  system 
fails  at  the  time  of  failure  of  the  last  component,  the 
lifelength  of  the  system  is  described  by  the  random  variable 
X(n)  *  max(X.j jXg,*** ,Xn). 

It  is  easy  to  envision  several  other  physical  situations  in  which 
the  random  variables  X^  and  X^  arise  quite  naturally.  For  instance, 
the  use  of  X^  for  setting  air  pollution  standards  Is  discussed  by 
Slngpurwalla  (1972)  and  by  Mittal  (1978);  and  the  use  of  X^j  in 
studying  the  time  for  a  liquid  to  corrode  through  a  surface  having 
a  large  number  of  small  pits  is  discussed  in  Mann,  Schafer,  and 
Singpurwalla  (1974),  p.  130. 

2.  Distribution  of  the  Extreme  Values 

Even  though  our  assumption  that  X-itX*,***^  are  independent  is 
hard  to  justify  In  practice,  we  shall,  in  the  Interest  of  simplicity 
and  an  easier  exposition,  continue  to  retain  It.  Note  that 

Ln(x)  ’  pr[xn>  <x]  * 1  - Pr[x(n  >x]  . 

-  1  -  Pr[x1>x,X2>x,***,Xn>x]  , 
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since  the  probability  that  the  smallest  value  is  larger  than  x  is  the 
same  as  the  probability  that  all  the  n  observations  exceed  x.  Because 
of  independence 


Ln(x)  =  1  -  ft  Prfx.  >xl  =  1  -  (1  -  F(x))n  , 
n  i*!  L  1  J 


(2.1) 


since  all  the  n  observations  have  a  coimion  distribution  F(x).  Using 
analogous  arguments  we  can  show  that 


Hn(x)  *  P»*[X(n)  <  x]  *  (F(x))n  . 


(2.2) 


Thus  under  independence,  when  F(x)  is  completely  specified,  we  can. 


in  principle,  find  the  distribution  of  Xn»  and  X/ 


Often  the 


distribution  functions,  t-n(x)  and  Hn(x),  take  simple  forms.  For 
example,  if  F(x)  is  an  exponential  distribution  with  a  scale  parameter 


X  >0,  that  is,  if  F(x)  =  1  -  e  ,  x  >0,  then  ln(x)  *  1  -  e",,A* 

—  again  an  exponential  distribution  with  a  scale  parameter  nX. 

Despite  the  simplicity  of  the  above  results,  there  are  two 
considerations  which  motivate  us  to  going  beyond  Equations  (2.1)  and 
(2.2).  The  first  consideration  pertains  to  the  fact  that  in  many 
cases  Ln(x)  or  Hn(x)  do  not  take  simple  and  manageable  forms,  and  the 
second  consideration  is  motivated  by  the  fact  that  in  many  practical 
applications  of  the  extreme  value  theory  n  is  very  large.  For  example, 
if  F(x)  =  1  -  e‘*x,  then  Hn(x)  =  (l  -  e"*x)n,  and  when  F(x)  is  the 
distribution  function  of  a  standard  normal  variate,  then 


H„(x) 


so  happens  that  under  some  very 


general  conditions  on  F(x),  the  distributions  of  X. 


and  X^  when 


n  becomes  large  take  simple  forms.  The  distributions  Ln(x)  and  Hn(x), 
when  n-*>»,  are  known  as  the  asymptotic  (or  the  limiting)  distribution 
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of  extreme  values,  and  the  associated  theory  which  enables  us  to  study 
these  is  known  as  the  asymptotic  theory  of  extremes',  the  word  "asymptotic" 
describes  the  fact  that  n  is  getting  large. 


2.1  The  Asymptotic  Distribution  of  Extremes 

The  key  notion  which  makes  the  asymptotic  distributions  of 
and  X^nj  of  interest  is  that  for  some  constants  on>  0n  >  0,  yn,  and 
5n  >  0,  the  quantities  (X^  -  an)/Bn  and  (*(n)  “  yn)/<5n  l)ecome  more 
and  more  independent  of  n.  The  «n,  8n,  yn»  and  <$n  are  referred  to  as 
the  normalizing  constants.  A  goal  of  the  asymptotic  theory  of  extreme 
values  is  to  specify  the  conditions  under  which  the  normalizing 
constants  exist,  and  to  determine  the  limiting  distribution  functions 
L(x)  and  H(x)  where 


and 


1  im  Pr 
n-*-® 


hrH 


1  im  Pr 
n  -*■“ 


max[min](X1 ,X2,***,Xn) 


*  lim  Ln(an  +  Snx)  =  L(x) 


*  lim  Hn(yn  +  6nx)  =  H(x)  . 
n  °° 

=  -min[max](-X1 ,-X2,***,-Xn)  , 


(2.3) 


(2.4) 

(2.5) 


the  theory  for  the  largest  extreme  is  identical  to  the  theory  for  the 
smallest  extreme  and  vice  versa.  However,  we  shall,  for  the  sake  of 
completeness,  give  the  pertinent  results  for  both  the  maxima  and 
the  minima. 
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The  fundamental  result  in  the  theory  of  extreme  values  was 
discovered  by  Frechet,  and  by  Fisher  and  Tippet  in  1928,  and  was 
formalized  in  1943  by  Gnedenko.  It  states  that  if  (x(n)  "  Yn)/<5n 
has  a  limiting  distribution  H(x),  then  H(x)  must  have  one  of  the 
three  possible  forms.  An  analogous  result  also  holds  for  (X^j  ““n^n* 
The  immediate  implication  of  this  result  is  that  irrespective  of  what 
the  original  distribution  F  is,  the  asymptotic  distribution  of 
(if  it  exists)  is  any  one  of  three  possible  forms.  Thus,  the  asymptotic 
distribution  of  the  extreme  values  is  in  some  sense  akin  to  the  normal 
distribution  for  the  sample  mean.  This  property  of  the  asymptotic 
distribution  of  the  extremes  is  another  motivation  for  our  study  of 
the  limiting  distributions. 

We  shall  suitmarize  the  above  results  via  the  following  theorem 
of  Gnedenko. 

Theorem  2.1  (Gnedenko):  Let  X-j **2’**’’*n  be  1 ndependent  and 
identically  distributed  with  distribution  function  F,  and  let 
*  max(X.|  ,X2,***,Xn).  Suppose  that  for  some  sequences  of 
normalizing  constants  {vn),  and  {<$n  >  0),  and  some  other  constants 
a  >  0,  b  >  0 


lim 
n  -*■  ® 


v 


x  -  a 
b 


(2.6) 


for  all  continuity  points  of  x,  where  H(*)  is  a  nondegenerate  distri¬ 
bution  function.  Then,  H(*)  must  belong  to  one  of  the  following  three 
"extreme  value  types": 


I(largest)  H 


>«o<x<«  ,  (2.7) 
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II  (largest) 


,  x<a 


(W)  • 


(2.8) 


x>a  ,  a>0  , 


Ill(largest)  = 


■(•(■  vr)  • 


x<a  ,  a>0 


(2.9) 


,  x>a 


Whenever  Equation  (2.7),  or  (2.8),  or  (2.9)  holds  for  some  sequences 
{y  >  and  {6„  >0},  we  shall  say  that  F  belongs  to  the  domain  of 

0  0  /  4  \  (  4  \ 

attraction  of  H'  ' ,  i*l,2,  or  3,  and  write  FS2(HV  ').  Furthermore, 
it  is  not  necessary  for  us  to  know  the  exact  form  of  F  in  order  to 
determine  to  which  domain  of  attraction  it  belongs.  A  useful  feature 
of  the  extreme  value  theory  is  that  it  is  just  the  behavior  of  the  tail 
of  F(x)  that  determines  its  domain  of  attraction.  Thus,  a  good  deal 
can  be  said  about  the  asymptotic  behavior  of  based  on  a  limited 
knowledge  about  F.  We  shall  formalize  the  above  facts  by  giving  below 
the  necessary  and  sufficient  conditions  for  F€S>(H^),  i=l,2,3. 

Theorem  2.2  (Gnedenko):  Let  xQ  <“  be  such  that  F(xQ)  *  1,  and 
F(x)  <  1  for  all  x  <  xQ.  Then 

a)  F€®(H^)  if  and  only  if  there  exists  a  continuous  function 
A(x)  such  that  lim  A(x)  3  0,  and  such  that  for  all  h, 

x*xo 

11m  l-WMW)).,-*  ■ 

X-XQ  1 -rW 

b)  F€0(h^)  if  and  only  if  lim  |  =  k°  for  each 

k>0,  and  a>0; 
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/-j\  1  -  F(x  -  kh) 

c)  F€0(H^J')  if  and  only  if  lim  ^  -  K)  =  ^  ^or  eac>1 

k>0,  and  o>0. 


We  note  from  the  above  theorem,  the  role  played  by  xQ,  the  tail 
point  of  F;  that  is,  the  point  where  F=l. 

Using  the  criteria  given  in  Theorem  2.2,  we  can  verify  that  if  F 
is  either  an  exponential,  or  a  normal,  or  a  Weibull  distribution 
(F(x)  =  l  -exp(-x°),  x>0,  a>0),  then  F  belongs  to  the  domain  of 
attraction  of  whereas  if  F  is  a  uniform  distribution,  then 
FS2>(h^);  this  conclusion  of  course  is  for  the  largest  values. 

Another  property  exhibited  by  the  extreme  value  type  distributions 

ial,2,3,  is  that  they  belong  to  their  own  domain  of  attraction. 
That  is,  H^€0(H^),  for  i=l,2,  or  3;  this  is  also  referred  to  as 
the  8 elf- locking  property. 

Methods  for  determining  the  constants  y„  and  <5„  involve  some 
additional  notation  and  detail,  and  these  can  be  found  in  Gnedenko 
(1943)  or  Calambos  (1978). 

Analogous  to  the  three  "extreme  value  types"  for  the  largest 
values  given  in  Theorem  2.1,  we  have  three  extreme  value  types  for 
the  smallest  values  X^.  That  is,  if 

lim  Pr  |  =  L(*"b a)  »  then  U*)  must  belong  to  one 

of  the  following: 


I(smallest)  *  1  -  exp(-exp(^-r-M)  , 


<x<»  ,  (2.10) 
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II (smallest) 


III (smallest) 


0 

1  -  exp 


,  x<a 


(■m . 


(2.11) 


1  -exp 
1 


«•  H*n  ■ 


x>a  ,  a>0  , 


x<a  ,  a>0  , 


(2.12) 


,  x>a  . 


Using  criteria  which  are  analogous  to  Theorem  2.2,  we  can  verify 
that  if  F  is  a  normal  distribution,  then  F€0(L^),  whereas  if  F  is 
an  exponential,  a  uniform,  or  a  Weibull,  then  F€£2(|_^).  Here  again, 
the  distributions  are  self-locking.  By  way  of  a  comnent,  we  note 
that  L'^(«)  is  in  fact  the  Weibull  distribution  which  was  mentioned 
before  and  which  is  quite  popular  in  reliability  theory. 

Current  research  in  extreme  value  theory  is  being  vigorously 
pursued  from  the  point  of  view  of  dropping  the  assumption  of  independence 
and  considering  dependent  sequences  x-|  »X2’“*,*n’  One  widely  used  class 
of  dependent  random  variables  is  the  exchangeable  one. 

Definition  [Galambos  (1978),  p.  127]:  The  random  variables 

*1 ’*2* *** ’*n  are  said  to  be  exchangeable  if  the  distribution  of  the 

vector  (X..  ,Xi  .•••.X,.  )  is  identical  to  that  of  (X1,**»,Xn)  for  all 
12  n 

permutations  (i-j the  subscn*Pts  0,2, •••»n). 

Generalizations  of  Gnedenko's  results  when  the  sequence  X^ ,Xn 
is  exchangeable  are  given  in  Chapter  3  of  Galambos  (1978).  For  an 
excellent  and  a  very  readable,  albeit  mathematical,  survey  of  results 
when  the  sequence  X-|,>*»,Xn  is  dependent,  we  refer  the  reader  to 
Leadbetter  (1975). 

Another  aspect  of  the  current  research  in  extreme  value  theory 
pertains  to  multivariate  extreme  value  distributions.  An  entry  on 
"Multivariate  Extreme  Value  Distributions"  appears  in  the  forthcoming 
Encyclopedia  of  Statistical  Sciences. 
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3.  Estimation  of  the  Parameters  of  the  Asymptotic  Distributions 


In  order  for  us  to  discuss  methods  for  estimating  the  parameters 
a,  b,  and  a  of  the  distributions  and  the  L^,  i=l,2,3,  it  will 
be  helpful  if  we  recognize  several  relationships  which  exist  between 
them. 

For  example,  if  we  denote  the  asymptotic  distribution  of 
*(n)’  \X b  /  ^  H^(a,b),  and  the  asymptotic  distribution  of 

X^j,  ^ aj  by  L^(a,b),  then  it  can  be  verified  that 

^2^-X^,  has  the  distribution  L^(-a,b).  We  shall  denote  the 

above  relationship  by  writing  "  H^(a,b) — ^(-a,b) .  "  In  a 
similar  manner,  if  we  denote  ^x-j- -  j  and  by  H^(a,b,a)  and 

L^(a,b,a)  respectively,  for  i*2,3,  then  H^(a,b,a) — llli.L^3^(-a,b,a)  and 
-X 

H^3^(a,b,a) — l^L^(-a,b,a).  If,  however,  Y^  S^X^,  and  the 


#-i 


location  parameter  a  =  0,  then  H^(0,b,a)  — lULl^^O.b"1  ,a)  and 


x*  i 

H^(0,b,a)  (n)>L^)(Q>b-t>ct)>  other  transformations  that  are  of 

-X/ 

interest  are  Y 


H(1)(a,b)- 


-X 


(n) 


k(n) 


and  Y^i)sJln  Xm ;  these  give  us 


V) 


ill<2)(0,e-4.b->)  and  L<2>(0,b,cO  b.„->. 


If  we  suppress  the  arguments  of  the  and  the  L^,  i*l  ,2,3, 
then  the  following  illustration,  suggested  to  us  by  Mr.  M.Y.  Wong, 
is  a  convenient  summary  of  the  above  relationships. 

It  is  easy  to  verify  that  in  the  following  illustration  the  reverse 
relationships  also  hold.  For  example,  if 
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In  view  of  this  last  relationship,  and  the  relationships  implied  by 
the  illustration  given  above,  it  follows  that  we  need  only  consider 
the  distribution  l^(a,b).  All  the  other  distributions  considered 
here  can  be  transformed  to  the  distribution  L^(a,b),  either  by  a 
change  of  variable  or  by  a  change  of  variable  with  a  setting  of  the 
location  parameter  equal  to  zero.  It  is  because  of  this  fact  that 
some  of  the  literature  on  the  Weibull  distribution  with  a  location 
parameter  of  0  (l^(0,b,a))  appears  under  the  heading  of  "an  extreme 
value  distribution"  which  Is  a  common  way  of  referring  to  the 
distribution 

When  the  location  parameter  a  associated  with  the  distributions 

and  1*2,3,  cannot  be  set  equal  to  zero,  most  of  the 

relationships  mentioned  before  do  not  hold,  and  thus  we  cannot  be 
content  by  just  considering  the  distribution  L^(a,b).  We  will  have 
to  consider  both  H^(a,b,a)  and  H^(a,b,a)  or  their  duals  L^(-a,b,a) 
and  L^(-a,b,a),  respectively.  Estimation  of  the  parameters  a 
(or  -a),  b,  and  a  is  discussed  in  the  next  section. 
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3.1  Estimation  for  the  Three-Parameter  Distributions 

The  standard  approach  for  estimating  the  three  parameters 
associated  with  H'^  and  L'  ,  1»2,3,  is  the  one  based  on  the  method 
of  maximum  likelihood.  Because  of  the  popularity  of  the  Wei  bull 
distribution,  the  case  L^(a,b,a)  has  been  investigated  very  extensively. 
We  shall  give  below  an  outline  of  the  results  for  this  case,  and  guide 
the  reader  to  the  relevant  references. 

Let  <x(2)  ^  *”  ^  X(n)  1)6  the  sma^est  ordered  observations 

in  a  sample  of  size  n  from  the  distribution  L^(a,b,a).  Harter  and 
Moore  (1965),  and  also  Mann,  Schafer,  and  Singpurwalla  (1974),  p.  186, 

(to  be  henceforth  abbreviated  as  MSS),  give  the  three  likelihood 
equations  and  suggest  procedures  for  an  iterative  solution  of  these. 

They  also  give  suggestions  for  dealing  with  problems  which  arise  when 
the  likelihood  function  increases  monotonically  in  (O.X^j). 

Lemon  (1974)  modified  the  likelihood  equations  so  that  one  need 
iteratively  solve  only  two  equations  for  estimates  of  the  location 
parameter  a  and  the  shape  parameter  a,  which  then  specify  an  estimate 
of  the  scale  parameter  b. 

MSS  discuss,  as  well,  the  graphical  method  of  estimation,  quick 
initial  estimates  proposed  by  Dubey  (1966),  and  iterative  procedures 
involving  linear  estimates  as  leading  to  a  median  unbiased  estimate 
of  a.  (A  recent  result  of  Somerville  (1977)  suggests  that  in  iteratively 
obtaining  a  median  unbiased  estimate  of  a  Weibull  location  parameter, 
k,  defined  at  the  bottom  of  p.  341  in  MSS,  should  be  approximately  k/5.) 

Rockette,  Antle,  and  Klimko  (1974)  have  conjectured  that  there 
are  never  more  than  two  solutions  to  the  likelihood  equations.  They 
show  that  If  there  exists  a  solution  that  is  a  local  maximum,  there 
Is  a  second  solution  that  Is  a  saddle  point.  They  also  show  that,  even 
if  a  solution  (a, 6, a)  is  a  local  maximum,  the  value  of  the  likelihood 


i 


i 
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function  L(a,b,a)  may  be  less  than  L(a0,b0,aQ)  where  aQ 


and  aQ  =  1,  and  bQ 


'(1)’ 


maximum  likelihood  estimate  of  the  mean  of  a 
two-parameter  exponential  distribution. 


3.2  Estimation  for  the  Two-Parameter  Distributions  j 

1 

When  the  location  parameter  a  associated  with  the  distributions  j 

and  L^,  1=2,3,  is  known,  or  can  be  set  equal  to  zero,  then  there  j 

are  several  approaches  that  can  be  used  to  obtain  good  point  estimators  \ 

of  the  parameters  b  and  a.  The  same  is  also  true  when  we  are  Interested  ’ 

in  the  parameters  a  and  b  of  and  L'^.  These  approaches  involve  ] 

an  iterative  solution  of  the  maximum  likelihood  equations,  and  the 
use  of  linear  estimation  techniques.  j 


3.2.1  Maximum  Likelihood  Estimation 

The  maximum  likelihood  method  has  the  advantage  that  it  can  be 
applied  efficiently  to  any  sort  of  censoring  of  the  data. 

For  all  the  extreme-value  distributions,  the  order  statistics  are 
the  sufficient  statistics.  Thus,  unless  there  are  only  two  observations, 
the  sufficient  statistics  are  not  complete  and  no  small -sample  optimality 
properties  hold  for  the  maximum  likelihood  estimators.  The  maximum 
likelihood  estimators  of  the  two  parameters  are,  however,  asymptotically 
unbiased  as  well  as  asymptotically  normal  and  asymptotically  efficient. 
One  can  use  the  maximum  likelihood  estimates  with  tables  of  Thoman, 

Bain,  and  Antle  (1970)  and  of  Billman,  Antle,  and  Bain  (1971)  to 
obtain  confidence  bounds  on  the  parameters. 


3.2.2  Linear  Estimation  Techniques 

Linear  techniques  allow  for  the  estimation  of  the  two  parameters 
of  Interest  without  the  necessity  of  iteration.  See  MSS  pp.  191-220 
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and  the  entries  Welbull  distribution,  beat  linear  invariant  (BLI) 
estimators  and  best  linear  unbiased  (BLU)  estimators.  Tables  of  Mann, 
Fertlg,  and  Scheuer  (1971)  and  Mann  and  Fertig  (1973)  can  be  used  with 
either  the  BLI  or  BLU  estimates  to  obtain  confidence  and  tolerance 
bounds  for  censored  samples  of  size  n,  n«3(l)25.  See  also  MSS,  p.  222, 
for  tables  with  n*3(l)13.  Thomas  and  Wilson  (1972)  compare  the  BLU 
and  BLI  estimators  with  other  approximately  optimal  linear  estimators 
based  on  all  the  order  statistics. 

If  samples  are  complete  and  sample  sizes  are  rather  large,  one 
can  use  tables  of  Chan  and  Kablr  (1969)  or  of  Hassanein  (1972)  to 
obtain  linear  estimates  of  a  and  b  based  on  from  2  through  10  order 
statistics.  These  tables  apply  to  weights  and  spacings  for  the  order 
statistics  that  define  estimators  that  are  asymptotically  unbiased 
with  asymptotically  smallest  variance.  Hassanein's  results  have  the 
restriction  that  the  spacings  are  the  same  for  both  estimators,  but 
he  also  considers  samples  with  10  percent  censoring.  Tables  of  Mann 
and  Fertig  (1977)  allow  for  removal  of  small-sample  bias  from  Hassanein's 
estimators  and  give  exact  variances  and  covariances.  This  enables  one 
to  calculate  approximate  confidence  bounds  from  these  estimators. 

For  samples  having  only  the  first  r  of  n  possible  observations, 
the  unbiased  linear  estimator  of  Engelhardt  and  Bain  (1973)  for  the 

r 


parameter  b,  b** 
r,n 


,?iix<s> 


X ( i ) I ( nkr  n^”1  very  e^cient»  especially 


for  heavy  censoring.  To  obtain  b**n,  one  need  only  know  a  tabulated 

r»n 

value  of  kr  and  an  appropriate  value  for  s;  s  is  a  function  of  r 
and  n. 

A  corresponding  estimator  for  a  is  then  given  by 
a"n  *  x(s) '  E(zs)b”n  ,,her'  zs  *  <x(s)  •  a)/b- 
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MSS,  pp.  208-214,  241-252,  give  tables  and  references  to  additional 

tables  for  using  these  estimators.  More  recent  references  that  aid  in 

the  use  of  these  estimators  are  in  Engelhardt  (1975). 

The  estimators  b**„  and  a**  approximate  the  BLU  estimators  and 
r,n  r*n 

can  be  converted  easily  to  approximations  to  the  BLI  estimators,  which 
in  turn  approximate  results  obtained  by  maximum  likelihood  procedures. 

The  estimator  b£*n  has  the  property  that  2b£*n/var(b£*n/b)  is  very 

nearly  a  chi-squared  variate  with  2/var(b**  /b)  degrees  of  freedom.  This 

r  »n 

property  holds  for  any  efficient  unbiased  estimator  of  b,  including  a 
maximum  likelihood  estimator  corrected  for  bias.  Because  the  BLI 
estimators  so  closely  approximate  the  maximum  likelihood  estimators  of 
b,  tables  yielding  biases  for  the  BLI  estimators  can  be  used  to  correct 
the  maximum  likelihood  estimators  for  bias. 

The  fact  that  unbiased  estimators  of  b  are  approximately  chi-squared 
variates  has  been  used  to  find  approximations  to  the  sampling  distributions 
of  functions  of  estimators  of  a  and  other  distribution  percentiles. 

MSS  describe  an  F-approx1mat1on  that  can  be  used  with  complete  samples 
to  obtain  confidence  bounds  on  very  high  (above  or  below  90  percent), 
or  very  low  distribution  percentiles,  or  with  highly  censored  data  to 
obtain  a  confidence  bound  for  a.  The  precision  of  this  approximation 
is  discussed  by  Lawless  (1975)  and  Mann  (1977,  1978).  Engelhardt  and 
8ain  (1977)  have  suggested  the  use  of  a  Jin  x2  approximation,  the  regions 
of  utility  of  which  tend  to  complement  those  of  the  F-approximation. 

Lawless  (1978)  reviews  methods  for  constructing  confidence  Intervals 
or  other  characteristics  of  the  Weibull  or  extreme-value  distribution. 
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